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Dedicated  to  Professor  Milan  Randid 


General  Introduction 

highly  Influential  paper  by  Randid  entitled  'On  characterization  of  molecular 
branching'  appeared  in^November,  1975  ifr^he  Journal  of  the  American  Chemical 
Society.^  In  this  paper  a  new  topological  index^^  was  introduced  which  has 
since  become  known  as  the  molecular  connectivity  index.  Topological  indices 
(TIs)  are  graph-theoretical  Invariants  which  are  employed  extensively  as 
mathematical  descriptors  for  a  wide  range  of  molecular  specie^^^^^^^^They  have 
been  used  mainly  for  the  purpose  of  correlating  the  properties  of  molecular  species 
with  their  topological  structure.  The  molecular  connectivity  index  was  originally 
put  forward  with  the  aim  of  characterizing  the  branching  in  alkane  species,  but 
more  recently  has  been  shown  to  have  numerous  applications  in  both  the  physical 
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biologicat  sciences^  To  date,  it  is  the  only  Tl  to  have  had  a  whole  book  devoted 
to  it.^n^his  review  we  shall  focus  on  the  use  of  the  index  in  the  more  biologically 
oriented  sciences,  such  as  pharmacology  and  toxicology.  In  particular,  we  shall, 
discuss  the  prediction  of  biological  activity  in  chemical  species  by  means  of 
this  index.v  Little  mention  will  be  made  here  of  other  applications  of  this  index; 
these  ar^^viewed  comprehensively  elsewhere.®'^ 

The  molecular  connectivity  index  as  originally  presented  by  Randi^^ 
was  designed  to  parallel  the  ordering  obtained  for  sets  of  isomeric  alkane  molecules 
based  on  their  physicochemical  propertieSj^The  index,  which  is  nowadays 
universally  represented  by  the  symbol  X,  was  defmed  in  the  following  way: 


led  in  the  following  way: 


1  (vj  Vj)“i  , 
edges  ■" 


where  vj  and  vj^  are  the  degrees  of  a  pair  of  neighboring  vertices  joined  by  the 
edge  {j,  j_}in  the  chemical  graph  of  the  molecule  under  consideration.  The  index 
was  soon  generalized  by  Randid  and  coworkers to  include  a  variety  of  subgraphs 
of  C  other  than  edges;  these  subgraphs  are  referred  to  as  paths,  clusters, 
path/clusters,  and  chains  and  are  illustrated  in  Figure  1.  The  generalized  form 
of  equation  (1)  may  be  written  explicitly  as: 


-Xt  *  I  Vj ....  Vh+i)":J 

subgraphs  ~  “■ 


or,  more  concisely,  as: 


i!h 

ilxt  *  I  n  , 

k-l  i-i  — 


where  ^  is  the  number  of  edges  in  the  subgraph  of  C  used,  t  is  the  type  of  subgraph 


considered  (see  Figure  1),  nj,  is  the  number  of  subgraphs  of  type  t  having  Ji  edges, 
and  the  index  k  extends  over  all  the  n}^  subgraphs. 

Since  it  is  of  critical  importance  in  making  correlations  involving  biological, 
parameters  to  be  able  to  deal  effectively  with  so-called  heteroatoms,  i.e.  atoms 
other  than  carbon,  in  molecular  graphs,  a  further  generalization  of  the  index 
was  made  by  Kier  and  Hall."*^  This  generalization  provides  a  means  of  taking 
into  account  situations  in  which  molecular  species  contain  heteroatoms;  such 
species  can  be  conveniently  modelled  using  rooted  graphs.^  The  differing  valencies 
which  heteroatoms  will  have  are  allowed  for  by  assigning  values  according  to 
the  formula: 

6  *  Z  -  Nh  ,  (4) 

where  5  is  the  difference  between  the  number  of  valence  electrons  in  the 
heteroatom  and  the  number  of  hydrogen  atoms  to  which  it  attached,  Z  is  the 
number  of  valence  electrons,  and  the  number  of  attached  hydrogen  atoms. 
Some  values  of  for  various  heteroatoms  are  presented  in  Table  1.  Whenever 
valence  corrected  atoms  are  used  in  the  calculation  of  the  molecular  connectivity 
index,  the  index  is  written  with  a  superscript  v  as  X''  to  indicate  this  fact. 

In  the  succeeding  sections  we  document  in  some  detail  the  manifold  correlations 
with  biological  activity  which  have  been  obtained  up  to  the  end  of  1985  using 
several  of  the  various  molecular  connectivity  indices.  The  correlations  span 
a  very  broad  range  of  biological  activities,  and,  for  convenience,  will  be  discussed 
under  the  four  headings  (i)  conscious  state  modification;  (ii)  olfactory  and  gustatory 
stimulation;  (ili)  inhibitory  and  depressant  activity;  and  (iv)  mutagenic  and  toxic 
behavior.  It  is  of  interest  to  note  that  molecular  connectivity  indices  first  began 
to  be  employed  for  correlations  involving  biological  parameters  after  it  was 


Chemical  Croup 


Value  of  6 


-OH  5 

-O-  6 

*0  6 

-NH2  3 

-NH-  4 

-NH  4 

-N-  5 

5N  5 


-Cl 


0.690 


-Br 


0.254 


-I 


0.085 


Table  1.  Values  for  6  for  several  different  chemical  groups. 


discovered  that  the  indices  correlate  well  with  certain  of  the  physicochemical 
properties  of  organic  compounds,  specifically  the  octanoi/water  partition 
coefficient**^  and  the  molecular  polarizability **3  (see  Figures  2  and  3).  These 
two  parameters  were  already  well  known  to  correlate  with  biological 
responses. **^'**5  As  observed  by  Randi4  connectivity  indices  encode  in  a  simple 
manner  information  on  the  bond  environments  of  the  various  atoms  comprising 
the  molecule.  Moreover,  since  the  indices  relate  to  properties  which  are  dependent 
upon  both  the  molecular  volume  and  surface  area,  they  should  effectively  reflect 
both  of  these  geometrical  features  of  a  molecule.^^  This  would  imply  a 
relationship  to  the  atom-bond  (polarization)  and  bond-bond  (dispersion)  behavior 
of  molecules.  It  is  reasonable  therefore  to  expect  that  connectivity  indices  are 
able  to  model  biological  responses  which  are  determined  by  such  forces. 
Interactions  based  only  on  forces  of  this  type  have  been  described  by  Sabijid  and 
Trinajstidas  nonspecific  interactions.^ 

Conscious  State  Modification 

In  this  section  we  discuss  the  correlations  obtained  for  anesthetic  inhalation 
gases,  local  anesthetics,  narcotic  and  hallucinogenic  agents.  Some  of  this  material 
has  already  been  covered  by  previous  reviewers,^"®  but  is  included  here  for  the 
sake  of  completeness.  Throughout  this  paper,  correlation  statistics  will  be  given 
in  terms  of  three  parameters:  r,  the  correlation  coefficient;  s,  the  standard 
deviation;  and  n,  the  number  of  data  points.  For  a  set  of  28  inhalation  anesthetics, 
including  several  halocarbon  species,  a  correlation  of  the  type: 


0.571  Oy''  -  0-638 


(r  «  0.881,  s  »  0.496,  n  =  28)  was  obtained^®  using  the  single  variable  °X'',  where 
p  is  the  effective  anesthetic  pressure  (ATA)  in  atmospheres.  By  inclusion  of 
a  variable  Q|-|  representing  the  charge  on  the  polar  hydrogen  atoms,  a  signficantly 
improved  result  (r  »  0.966,  s  «  0.278,  h  =  28)  was  found.  A  later  study ©n  45 
halogenated  hydroccU’bons  confirmed  these  earlier  results  by  again  showing  that 
both  °x''  and  0|-j  were  necessary  to  achieve  a  good  correlation  (r  »  0.975,  s  = 
0.27,  n  »  45).  A  similar  study^O  on  28  emesthetic  aliphatic  ethers,  however, 
revealed  that  was  the  relevant  variable  for  correlation  with  the  AD50  molar 
concentration.  Linear  correlation  with  ”lx  produced  statistics  (r  «  0.979,  s  = 
0.076,  n  -  28)  which  were  improved  when  parabolic  correlation,  i.e.  the  addition 
of  a  (^xJ^  term,  was  attempted  (r  =  0.986,  s  =  0.063,  n  *  28) 

Barbiturates  and  their  derivatives  were  the  subject  of  investigations  undertaken 
by  Murray  et  al.,21  Bonjean  and  Luu  Duc,22  and  Basak  et  al.23  The  correlations 
were  made  between  the  logarithm  of  the  partition  coefficient  and  the 

correlation  coefficients  ranged  from  0.847  to  0.995.  Use  was  made  of  both  linear 
and  parabolic  correlations,  and  it  is  interesting  to  note  that  '^X^  was  superior 
to  log  p  in  the  correlation  of  the  in  vivo  activity  of  barbiturates.23  Several  studies 
on  hallucinogenic  agents  have  demonstrated  the  great  predictive  potential  of 
the  indices  in  this  domain.  To  date,  studies  have  been  carried  out  on  the 
phemethylamines  (mescaline  analogs),^^  the  phenylisopropylamines  (amphetamine 
analogs), 25/26  3©,^  the  indoleamines  (lysergamide  analogs).27  The  correlations 
of  hallucinogenic  activity  were  made  in  mescaline  units  and  based  on  a  two- 
or  three-parameter  approach.  For  the  amphetamine  analogs,  the  equation  assumed 


logp  -  45.16(3^+  1.288  ®Xp  -  4.298  (■^^)  -  5.592 

AO  ^-1* 


the  form; 


(r  =  0.920,  s  =  0.251,  n  =  23),  where  m  is  the  molar  activity.  This  equation  was 
found  to  be  so  reliable  that  it  was  subsequently  used  to  predict  the  activities 
of  various  other  amphetamines  as  well  as  those  of  mescaline  and  tryptamine 

species.  25 

A  number  of  important  correlations  have  been  established  between  either 
^  X  O'"  3nd  the  local  anesthetic  behavior  or  the  narcotic  activites  of  various 
organic  molecules  toward  living  organisms.  A  significant  correlation,  illustrated 
in  Figure  4,  was  found  by  Kier  et  al.13  between  the  minimum  blocking 
concentration,  MBC,  of  36  local  anesthetics  using  cells  under  standard  conditions 
and  \  as  follows: 


log  MBC  =  3.55  -  0762  ^X 

(r  =  0.983,  s  *  0.39,  n  =  36).  Similar  relationships  were  recorded''^  with  the 
biologically  relevant  properties  of  solvent  cavity  surface  area  and  molecular 
polarizability  of  these  36  molecules.  Whole  animal  narcosis  studies  have  related 
the  narcotic  effects  of  molecules  on  larvae  or  frog  tadpoles  and  are  summarized 
in  Kier  and  Hall's  book.8  These  studies  have  included  correlation  of  the  narcotic 
concentration  of  20  organic  compounds  for  Aren i cola  larvae,®  the  narcotic 
concentration  of  15  alcohols  with  barnacle  larvae,**^  the  effective  concentration 
of  52  organic  compounds  for  tadpole  narcosis,®  and  a  repeat  of  the  latter 
correlation  using  only  36  organic  compounds.^2  |p  these  four  latter  linear 
correlations  the  correlation  coefficients  were  respectively  0.979,  0.987,  0.886 
and  0.956.  The  differing  coefficents  for  tadpole  narcosis  reflect  the  fact  that 
two  different  mechanisms  operate.®  Similar  results  were  reported  by  Murray 
et  al.*'^  who  investigated  the  I -jog  movement  factor  for  both  2.5-day-old  and 
12-day-oId  tadpoles  at  18*C. 


Olfactory  and  Gustatory  Stimulation 

A  satisfactory  basis  for  the  classification  of  various  odorant  molecues  in 
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terms  of  their  shape  and  size  has  been  advanced  by  Amoore.  '  The  molecules 
in  the  various  categories  proposed  by  Amoore^^  have  been  ranked  according 
to  their  degree  of  odor  similarity,  OS.  Kier  et  al.^O  investigated  the  relationship 
of  OS  to  molecular  connectivity  indices  for  four  categories  of  odorants.  The 
results  obtained  were  as  follows: 

For  15  etheral  odorants: 


OS  =  7.47  -  1.84  2x+  1.34  3xc 
(r  =  0.921,  s  *  0.395,  n  =  15). 

For  16  floral  odorants: 

OS  =  3.12e  -l-eecOx  -  9.51)2  ^  3  ^3 

(r  *  0.959,  s  =  0.225,  n  -  16). 

For  14  odiferous  carboxylic  acids: 

logAct  =  3.S34''x  -0.387(\)2  -  S.OV^Xp  -  3.184 


(r  »  0.949,  s  =  0.343,  n  *  14),  where  Acj  is  the  threshold  concentration  of  the 


acid  needed  to  overcome  anosima. 

For  24  substituted  benzaldehydes  and  nitrobenzenes: 

OS  =  15.02  -  2.122  oxv  +  2.424  3 

(r  =  0.937,  s  =  0.60,  n  =  24). 

The  meaningful  correlations  obtained  between  quantitative  odor  parameters 
and  various  connectivity  indices  indicate  that  the  latter  reliably  model 
structure-activity  relationships  for  several  different  categories  of  odorants. 

The  relationships  between  sweet  and  bitter  tastes  and  molecular  structures 
have  also  been  explored  using  connectivity  indices.  Hall  and  Kier^T  studied  a 
set  of  substituted  ritroanilines  which  possess  a  sweet  taste  potency  of  up  to 
4000  times  that  of  sucrose.  By  expressing  sweetness  as  the  relative  molar 
sweetness  with  respect  to  sucrose,  RS,  a  relationship  of  the  following  form  was 
obtained: 

logRS  =  0.350  I X  +  0.694  '•xv  -  3.856 

(r  =  0.953,  s  =  0.222,  n  =  9). 

Another  study3  on  a  set  of  cyclohexyaldoxines  attempted  to  assign  molecules 
to  sweet  or  bitter  classes.  The  assignment  was  made  on  the  basis  of  the  value 
assumed  by  a  linear  discriminant  function  made  up  of  "^X  and  ^Xp: 

y  «  1.21  '•x  -  3.88  4xp. 

When  y  >  -3.27  assignment  was  made  to  the  sweet  class;  when  y  <  -3.27 


assignment  was  made  to  the  bitter  class.  Using  this  criterion  it  was  possible 
to  assign  correctly  9  out  of  10  molecules  to  the  sweet  class  and  8  out  of  10  to 
the  bitter  class.  The  statistical  significance,  stated  as  the  value,  was  5.9. 
Taste  thresholds  were  also  Investigated  by  Gardner33,34  for  various  homologous 
series  of  compounds,  amino  acids  and  peptides  of  importance  in  the  production 
of  beer.  Linear  correlations  with  were  obtained,  with  correlation  coefficients 

lying  between  0.701  and  0.943. 

Inhibitory  and  Depressant  Activity 

A  number  of  studies  have  correlated  the  inhibition  of  enzymes  with  different 
connectivity  indices.  Such  studies  have  included  the  inhibition  of  succinate*^^ 
and  hydrazide  monoamine^^  oxidase,  thymidine  phosphor!  lase,”'^  adenosine 
deaminase, butyrylcholesterinase,^^  and  ribonucleotide  reductase.^^  For 
succinate  oxidase  a  model  using  only  was  presented: 

pC  =  0.916  ^x  -  1.582 

(r  =  0.966,  s  =  0.169,  n  =  13) 

whereas  for  hydrazide  monoamine  oxide  a  three-parameter  model  was  required: 

PI50  *  -5.2  -  29E  -  0.82  2x  +  1.8  3x^ 

(r  =  0.941,  s  =  0.201,  n  =  24), 

where  pC  is  the  concentration  of  115-20  succinate  oxidase,  and  E  is  the  half-wave 


potential.  The  correlation  obtained  for  ribonucleotide  reductase^^  was  also 
a  three-parameter  equation: 

pC  -  2.36  3xp  -  3.98  Ox''  +  0.97  (1x^)2  +  9.20 

(r  «  0.943,  s  >  0.21,  n  »  28).  The  study  suggested  that  the  benzohydroxamic  acid 
inhibitors  used  display  inhibitory  potency  in  two  ways  associated  with  two  different 
parts  of  the  molecule. 

A  congeneric  series  of  quarternary  ammonium  salts  was  employed^**  for 
correlations  of  minimum  inhibitory  concentrations,  minimum  killing  concentrations 
and  minimum  concentrations  for  50%  hemolysis  against  various  microorganisms, 
including  Staphylococcus  typhosa  and  Staphylococcus  aureus.  The  correlations 
all  assumed  a  parabolic  form  using  ^x  and  correlation  coefficients  ranged 

from  0.872  to  0.982.  It  was  conjectured  that  the  correlations  were  due  to  the 
fact  that  ”*x  mirrors  well  the  size  and  shape  of  the  molecules  involved.  Similar 
results  were  reported^**  for  a  set  of  aromatic  and  heterocyclic  amines  using 
the  minimum  inhibitory  concentration  on  Mycobacterium  tuberculosis.  The 
inhibition  of  the  microsomal  p-hydroxylatlon  of  aniline  by  alkanols  was  investigated 
by  SabIJidand  Proti^Sabljid37  They  found  the  best  two-parameter  equation 
to  be  the  following: 

pICso  *  -6.88  (5^)  -  1.14  ^xpc  +  l-SS 

(r  =  0.983,  s  *  0.156,  n  *  20),  and  concluded  that  the  alcohols  bind  at  the 
enzyme-active  site,  with  the  inhibiting  effect  being  the  result  of  a  fine  balance 
between  the  size  and  degree  of  branching  in  the  alkyl  chain. 

The  correlating  equation  between  the  muscarinic  receptor  affinity  of  104 
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optically  inactive  acetylcholine  antagonists  and  three  different  connectivity 
indices  was  found  by  Kier  and  Hall^S  to  be: 

logK  =  0.749  4xpc  +  0.258 '’x''  +  1.340  +  0.827 

(r  =  0.962,  s  =>  0.331,  n  =  104),  where  log  K  expresses  the  receptor  affinity.  A 
check  on  the  reliability  of  the  correlation  was  performed  by  substituting  random 
numbers  for  all  of  the  22  different  indices  considered  and  repeating  the  procedure 
100  times.  The  highest  correlation  coefficient  obtained  in  this  way  was  only 
0.45;  the  relation  thus  reflects  the  systematic  variation  of  log  K  with  molecular 
structure.  Moreover,  the  analysis  revealed  that  the  separate  ends  of  the  molecule 
are  of  greatest  Importance  in  determining  the  observed  activity,  namely  the 
onium  group  and  the  side  chain.  The  contribution  to  log  K  from  the  increasing 
number  of  atoms  in  the  molecules  (reflected  In  was  offset  by  a  decline  In 
together  with  a  levelling  of  the  \pc  values  in  a  set  of  eight  test  onium 
compounds.  When  the  relationship  was  used  to  predict  both  agonist  and  antagonist 
affinities,  very  satisfactory  results  were  obtained. ^8 

The  behavior  of  isatin  derivatives  in  regard  to  their  mitodepressant  activity 
and  binding  to  human  serum  albumin  has  been  investigated  by  Sablid  et  al.^^ 
using  both  and  the  Hilckel  energy  of  the  highest  occupied  molecular  orbital, 
EhoMO*  multiple  correlation  obtained  for  protein  binding  of  16  Isatin 

derivatives  was: 

P.B.  »  1.672  (1x'')2  -  16.924  '•x''  -  131.538  (EhoMO^^ 

+  129.285  EhomO  +  42.155 


with  a  correlation  coefficient  of  0.892.  The  equation  obtained  for  mitodepressant 


1^ ^  ^  Si  ^■t  *-iU*‘rt  *rti  i*  i'i*a  i*a 


«>«*»* ' 


activity  was  the  parabolic  one  with 


log-  =  -  0.060  (‘'x'')2  +  0.884  (“I xV)  -1.710 


(r  =  0.820,  n  3:  26),  though  the  best  multiple  correlation  equation  assumed  the 
form: 

log>J  »  -4.624  (EhomO>^  +  ^•'^51  EhomO 

+  0.126  lx ''  -  -0.253 


(r  =  0.841,  n  =  26).  The  comparatively  low  values  of  the  correlation  coefficients 
were  ascribed  to  the  low  accuracy  of  the  biological  data  used. 

The  antimicrobial  activity  of  phenyl  propyl  ethers  against  skin  fungi  and  the 
antiviral  activity  of  benzimidazoles  against  the  Lee  strain  of  6  influenza  virus 
were  studied  using  connectivity  Indices  by  Hall  and  Kier.^  After  showing  that 
these  two  types  of  activity  correlated  well  (correlation  coefficients  of  0.955 
and  0.966  for  a  three-parameter  and  a  two-parameter  equation,  respectively), 
they  went  on  to  partition  the  significant  regression  variables  into  subgraph  terms 
reflecting  various  structural  features  of  the  molecules  in  question.  This  analysis 
led  to  several  instructive  conclusions  about  the  nature  of  the  principal  interactions 
involved.  For  instance,  it  was  found  that  in  the  phenyl  propyl  ethers  the  principal 
interactions  focus  on  the  para-position  of  the  phenyl  ring;  and  in  the 
benzimidazoles  that  substitution  of  branched  or  cyclic  alkyl  groups  on  the 
2-position  (five-memfaered  ring)  is  important  for  high  activity.  Such  results  are 
clearly  superior  to  earlier  studies  based  on  electronic  factors.  Two  other  studies 
which  set  out  to  compare  the  molecular  connectivity  approach  with  other 
approaches  also  revealed  the  power  of  the  former  approach.^”*»^2  activity 


chosen  in  each  case  was  the  antimicrobial  activity  of  an  identical  set  of  50  chloro-, 
bromo-  and  alky Iphenols  against  Staphylococcus  aureus.  Both  studies  established 
the  substantial  advantages  inherent  in  the  use  of  purely  topological  methods 
to  carry  out  structure-property  correlations,  including  the  extreme  simplicity 
of  the  method  and  the  high  reliability  of  the  correlations  obtained.  A  further 
confirmation  of  the  value  of  molecular  connectivity  indices  was  presented  by 
Samanta  et  al.^^  who  investigated  the  anitfungal  activity  of  12  phenolic  compounds 
and  were  able  to  show  that  such  activity  was  modelled  better  by  use  of  than 
by  the  logarithm  of  the  octanol/water  partition  coefficient. 

Mutagenic  and  Toxic  Behavior 

To  date,  very  little  has  been  attempted  In  correlating  mutagenic  behavior 
with  molecular  connectivity  indices  whereas  toxicity  has  been  fairly  widely 
correlated.  One  reason  for  this  may  be  the  comparative  paucity  of  reliable  data 
on  the  mutagenclty  of  molecules.  A  study  carried  out  by  Kier  et  al.^  on  15 
nitrosamines  known  to  occur  in  cigarette  smoke,  nitrate-pickled  meat,  and  smoked 
fish  correlated  their  mutagenicity  as  revealed  by  the  Ames  test  against  a  variety 
of  connectivity  indices.  Two  different  equations  were  obtained  with  closely 
similar  statistics,  namely: 

logR  *  2.398‘’x  -  4.095  “He  v  -  5.590 


(r  =  0.964,  s  -  1.09,  n  »  15), 
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log  R  =  2.946  2x  -  0.090  4  xV  -  4.662 

<r  »  0.967,  s  *  1.05,  n  »  15),  where  R  is  the  number  of  revertants  per  nanomole. 
The  equations  were  feit44  to  be  good  enough  to  serve  as  screening  devices  for 
untested  nitrosamines.  The  closely  related  property  of  carcinogenicity  was 
investigated  by  Jurs  et  al.45  using  a  large,  heterogeneous  data  set  of  organic 
compounds.  The  objective  was  to  separate  the  set  into  carcinogens  and 
noncarcinogens  using  a  variety  of  descriptors,  including  several  connectivity 
indices.  Pattern  recognition  analysis  demonstrated  that  no  descriptors  were 
completely  effective  here,  the  best  achieving  only  a  90%  correct  separation. 
However,  it  was  concluded^S  that  connectivity  indices  are  appropriate  for  the 
broad  classification  of  compounds  into  carcinogens  or  noncarcinogens. 

A  correlation  coefficient  of  0.9972  was  obtained^S  in  a  one-parameter 
correlation  of  the  toxicity  of  polycyclic  aromatic  hydrocarbons  in  Daphnia  Pulex 
with 

-logLCso  =  0.5346  ^x''  -  7.004 

(r  =  0.9972,  n  =  5),  where  LC50  is  the  lethal  concentration  at  96  hours.  Although 
only  five  values  were  available  for  correlation,  up  to  26  values  were  employed 
for  correlations  of  various  x'^  terms  with  other  hydrocarbon  data.  It  was  shown 
that  chromatographic,  partitioning,  bioconcentration,  and  toxification  processes 
are  closely  intercorrelated;  correlation  coefficients  for  correlations  between 
the  various  data  sets  ranged  from  0.8967  to  0.9989.  Two  studies,  based  on  aliphatic 
hydrocarbons,  ethers  and  ketones,47  and  substituted  arylamines,^  Intercorrelated 
the  anesthetic  and  toxic  activities  of  these  compounds.  The  equations  obtained 
were  two-parameter  equations  based  on  °x'^/  n  o'"  again,  very 

r  r 


good  correlations  were  established  between  anesthetic  and  toxic  activities.  Basak 
et  al.^^  used  IS  industrially  important  esters  in  a  correlation  of  the  LC50  values 
in  Pimephales  Promelas  with  ^ind  The  best  single  parameter  for  such 

prediction  was  though  a  multiparametric  relationship  based  on  the 

octanol/water  partition  coefficient  and  a  steric  parameter  proved  to  yield  the 
most  significant  correlation. 

A  number  of  studies  have  been  undertaken  of  the  toxicity  of  compounds 
containing  either  chlorine  or  nitrogen  heteroatoms.  Sabljid^O  investigated  the 
toxicity  of  19  chlorinated  aromatic  and  aliphatic  hydrocarbons  and  phenols  toward 
the  sheepshead  minnow  Cyprinodon  variegatus.  The  best  single-parameter 
correlation  was  obtained  for  though  ”^x  ^nd  ^  also  gave  reasonable 
correlations.  The  derived  equations  account  for  no  more  than  84-88%  of  the 
variation  in  the  toxicity  data;  this  is  hardly  surprising  in  view  of  the  low  accuracy 
of  these  data.  A  set  of  chlorinated  phenols  and  a  miscellaneous  set  of  organic 
compounds  containing  halogenated  and  nitrated  products  were  used  by  Koch^l 
for  correlations  of  their  toxicity  to  a  variety  or  organisms,  including  rats  and 
guppies,  with  connectivity  indices.  The  best  result  was  from  the  correlation 
of  the  LC50  concentration  of  chlorophenols  using  guppies: 

logLCso  *  3.257  -  0.6719  '^x'' 

(r  “  0.982,  n  *  10).  The  general  conclusion  reached  was  that  the  correlation 
coefficients  depend  on  the  organism  used  and  the  similarity  of  the  structure 
of  the  chemical  compounds  involved,  with  closely  similar  compounds  yielding 
the  best  results.  Various  nitrogen-containing  heterocycies  were  investigated 
by  Schultz  et  al.52  for  correlations  of  their  toxicity  toward  axenic  cultures  of 
the  freshwater  ciliate  Tetrahymena  pyriformis.  A  better  correlation  with  "'x'' 
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was  obtained  for  six-membered  heterocycles  than  that  for  five-membered 
heterocycles  (r  *  0.988  as  against  0.966).  Elucidation  of  the  effect  of  structure 
on  toxicity  appears  to  be  possible  from  an  analysis  of  the  correlations  found. 

The  molecular  structure  of  hydroxylated  compounds,  namely  alcohols  and 
phenols,  as  reflected  by  connectivity  indices,  also  seems  to  be  closely  related 
to  their  toxicity.  Kier  and  Hal|53  used  the  indices  of  a  set  of  20  mixed  alcohols 
to  correlate  with  the  haemolysis  of  erythrocytes,  the  inhibition  of  the  Tubifex 
worm  movement,  fish  narcosis,  and  the  LD50  dose  in  mice.  One-parameter 
equations  gave  a  fair  account  of  the  variation  of  potency  with  molecular  structure 
in  all  cases,  suggesting  that  the  heteroatom  plays  a  more  or  less  constant  role 
in  these  biological  processes.  A  set  of  25  substituted  phenols  was  employed  by 
Hall  and  Kier^**  for  a  toxicity  correlation  using  the  fathead  minnow  Pimephales 
promelas.  The  best  single-parameter  equation  was  found  to  be: 


phCso  =  1.079  3xV  +  2.528 


(r  *  0.903,  s  =  0.347,  n  =  25);  the  best  two-parameter  equation  (using  \  and 
improved  r  to  0.934.  The  equations  were  considered  good  enough  to  predict 
unknown  phenol  toxicities.^^ 

Organotin  compounds  have  been  the  subject  of  two  recent  studies  by  Laughlin 
et  al.^^  and  Vighi  and  Calamari.56  In  the  former  study  the  LC50  values  of  eight 
triorganotin  compounds  for  the  crab  zoeae  Rhithropanopeus  harrisii  were 
correlated  with  a  variety  of  different  parameters.  Although  connectivity  indices 
were  not  employed,  it  was  evident  that  the  best  correlations  were  obtained  from 
parameters,  such  as  molar  surface  areas,  which  can  be  reliably  modelled  using 
topological  indices.  In  the  latter  study  explicit  use  was  made  of  "*x  snd  *'x'^ 
in  characterizing  the  toxicities  of  15  organotin  compounds  against  Daphnia  magna. 
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This  study  confirmed  that  molecular  topology  can  be  employed  to  model  the 
behavior  of  organotin  compounds.  With  a  single-parameter  equation  the  best 
correlation  obtained  was: 

log(-^^)  *  0.749 '•x''  -  S.63 

(r  »  0.925,  n  =  12),  where  EC50  is  the  effective  concentration  of  organotin  which 
is  lethal  to  50%  of  the  organisms.  The  best  overall  correlation  was  obtained 
using  a  triparametric  approach  based  on  lipophilic  (log  P),  electronic  (pK^)  and 
steric  (^x)  characteristics;  this  raised  the  correlation  coefficient  to  0.989. 

Some  Miscellaneous  Applications 

In  this  section  brief  mention  is  made  of  a  number  of  publications  which  have 
some  bearing  on  the  prediction  of  biological  behavior  in  chemical  species.  The 
way  in  which  connectivity  indices  are  related  to  molecular  volume  was  investigated 
by  Hall  and  Kier,57  and  the  capacity  of  the  Indices  to  model  lipophilicity  was 
studied  by  Basak  et  al.^S  The  use  of  connectivity  indices  in  the  evaluation  of 
environmental  pollutants  has  been  discussed  by  Koch.^^  Connectivity  indices 
have  also  found  application  in  both  discriminant  analysis^^  and  principal  component 
analysis. structural  information  which  molecular  connectivities  indices 
encode  has  been  the  subject  of  several  papers.^^"^^  The  extent  to  which  the 
indices  can  model  steric  phenomena  has  also  been  explored  by  several  authors.®^"®® 
The  possiblity  of  obtaining  good  correlations  in  multiple  regression  analyses  by 
chance  has  been  ascertained  by  Topliss  and  Costello,®^  and  shown  to  be  negligible 
for  the  great  majority  of  the  studies  described  herein. 
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Figure  1.  Examples  of  the  four  different  types  of  subgraphs  used  in  the  calculation 
of  molecular  connectivity  indices. 

Figure  2.  The  correlation  of  logarithm  of  the  octanol/water  partition  coefficient 
against  ”*x  a  mixture  of  138  esters,  carboxylic  acids,  alcohols,  amines,  ketones 
and  ethers.  [Based  on  W.J.  Murray,  L.H.  Hall  and  L.B.  Kier,  J.  Pharm.  Scis  ^ 
(1975)  1978.] 

Figure  3.  The  correlation  of  molecular  polarizability  against  \  for  36  different 
compounds  used  in  a  nonspecific  anesthesia  study.  [Based  on  L.B.  Kier,  L.H.  Hail, 
W.J.  Murray  and  M.  Randi<i;  J.  Pharm.  Scls^(1975)  1971.] 

Figure  4.  The  correlation  of  logarithm  of  the  minimum  blocking  concentration 
against  "*x  ^'or  36  different  compounds  used  in  a  nonspecific  anesthesia  study. 
[Based  on  L.B.  Kier,  L.H.  Hall,  W.J.  Murray  and  M.  Randid,  J.  Pharm.  Scis  ^ 
(1975)  1971.] 


SUBGRAPH  EXAMPLES 


Chain 


